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ABSTRACT 


In this document, the logic structure of a unlvers..! VLSI chip called the 
symbol-slice Reed-Solomon (RS) encoder chip is presented. An RS encoder can be 
constructed by cascading and properly interconnecting a group of such VLSI chips. 
As a design example, it is shovsm that a (255,223) RS encoder requiring around ^<0 
discrete CMOS IC*s may be replaced by an RS encoder consisting of four identical 
interconnected VLSI RS encoder chips. Besides the size advantage, the VLSI RS 
encoder also has the potential advantages of requiring less power and having a 
higher reliability. 
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SUCTION 1 


INTRODUCTION 

Reed-Solomon (RS) codes (Ref. 1) are nonbinary BCH cades. These 
codes can correct both random and burst errors over a communication channel, Re- 
cently concatenated coding systems using RS codes as the outer codes have been 
proposed for space communication to achieve very low error probabilities (Refs, 

2 to 7). Several deep space flight projects each as the Voyager at Uranus en- 
counter, the Galileo, and the International Solar Polar Mission (ISPM) have also 
considered using the concatenated RS/Viterbl channel coding scheme. Hence RS 
codes are quite important for space communications. 

The complexity of an RS encoder is proportional to the error- 
correcting capability of the code, the speed of the encoding, and the Interleaving 
level used (Ref. A). For reliable space communication there is a need to use RS 
codes with large error-co.rect ing capability and large Interleaving level (Refs. 

A, 5, 8, and 9. Hence one is especially interested in minimizing the complexity 
of RS encoders for space communication applications. In a spacecraft the power, 
size, and reliability requirements are usually quite severe. Thus there is con- 
siderable interest in a VLSI (Very Large Scale Integration) RS encoder which has 
the potential for significant savings in size, weight, and power while at the 
same time providing higher reliability over an RS encoder implemented in discrete 
logic circuits. 

This document Introduces a symbol-sliced logic structure suitable for 
a VLSI implementation of RS encoders. By cascading and properly interconnecting 
a group of such ^7LS1 chips, each consisting of a fixed portion of the encoder, it 
is possible to obtain an RS encoder with any desired error-correcting capability 
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and interleaving level. An a doBlgn example, it is ohovfli that a (255,223) RS 
encoder requiring 40 discrete CMOS IC's may be replaced by an RS encoder conoiot” 
in, of four identical interconnected VLSI encoder chips. It is also shown that 
these VLSI RS encoder chips can be pujalleled to Improve the encoding speed. 
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SECTION II 


BASIC CONCEPTS OF FINITE FIELDS 

A field lo a set of elementjj, inrludinB 0 and 1, any pair of which 
may be added or mulftpiled (denoted by + and *) to p,lve a unique reoult in the 
field. The addition and multiplication are aaooclutive and conuimtat 1 ve , and multi- 
plication dlotributofi over addition in the usual way, i.e., 

u * (v+w) ° u*v+u*w 

Every field element u has a unique negative element -u in the aame field such 
that 


u + (-u) ° 0 

Every nonzero field element u has a unique reciprocal field element 1/u, such tlmt 

u * (1/u) “ 1 


For every field element u 


0 + U'='U‘=‘ l*u 


and 


o * u “ 0 

If the number of elements in a field is Infinite, then it is called 
an infinite field. Examples of infinite fields are: 

(a) The rational number field. 

(b) The real number field. 

(c) The complex number field. 
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If the number of elements in u field io finite, then it in colled a finite field 
or a (lullofj field CF(q^, where q lo the number of elementa in the field. Two 
exampleu of finite fieldo are given oh follov;o. The flrat example lo the finite 
field (!I'(p) which io formed \)y integero modulo p, where p lo a prime. If p ° 2, 
tl\en the field la called GK(2). (IF(2) containa only two elemento, l.e., 0 and 1. 

The addition and multiplication tobleo of GF(2) are given ao below: 


0 

1 


0 1 0 1 

0 
1 


o 

! 

1 

1 

0 


0 

0 

0 

1 


Addition 


Multiplication 


The aocond example is the finite field formed by polynomials modulo 
and irreducible polynomial of degree m with coefficients in GF(p), where p is a 
prime. The definition of an irreducible polynomial is given as follows. A 
polynomial 


« i 

p(x) “ X] ^ 
i°0 

with f GF(p) is called irreducible over GF(p) if there exists no polynomials 
A(x) and B(x) with coefficients from GF(p) such that 

p(x) •= A(x) B(x) 


where 


1 < Degree of A(x) < m-1 
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Aq an example, if p = 2, then the polynoraiala 


and 


+ X 


^ 3 a. 2 ^ , 

+ X + X + I 


4 

X 


+ X + 1 


are Irreducible over GF(2). 

Now the multiplicative structure of finite fields will bo discussed. 

If u field contains an element a, then the least positive integer N for which 
N 

01 “ 1 is called t!ie order of a. In a finite field of q elements, GF(q) , there 
is a primitive element a, i.e., an element of order q - 1. Every nonzero element 
of GF(q) can be expressed as a power of u. 

Next the vector space structure of finite fields will be presented. 

To do It, one needs the following definition of a primitive polynomial. An 
Irreducible polynomial of degree m over GF(q) Is called primitive if it has a 
primitive element of GF(q*^) as a root. A finite field of elements, GF(q"') , 
can be considered as an m-d Imensional vector space over GF(q). A choice for a 
basis of GFCq*”) over GF(q) is the set 

L 2 m-1] 

jl, a, a , , a J 

which is called the canonical basis, where a is a root of a primitive polynomial 
of degree m over GF(q). In vector form 

, 2 m-ll 

1, a, a , , a j 

is represented as 

1 -H. (0, 0, , 0, 0, 1) 
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n 


a -H. (0, 0, , 0. I, 0) 

-H. (0, 0, , 1, 0, 0) 

I 

I 

I 

I 

I 

( 1 , 0 , , 0 , 0 , 0 ) 

whereas 0 ts mapped to (0, 0, , 0). Thus uLl elements in GF(q'”) except 0 can 

be formed by linear combinations of tiie canonical basis 

, 2 m-1 

1. « , . a 

4 4 

As an example, the Galois field of 2 elements, Gl(2 ), may be formed 

4 

as the field of polynomials over GF(2) modules (x + x + 1) , which is a primitive 

4 

polynomial of degree 4. Let a be a root of x + x + 1, i.e., 

4 

a -t- a + 1 “ 0 

4 

Then by the definition of the primitive polynomial, all elements of GF(2 ) except 
0 are powers of a. The representation of for 4<i<lS can be determined by the 

4 

primitive polynomial. In this example a is a root of x + x + 1 over GF(2) . 

Thus 

4 

a + a + 1 = 0 


It follows that 



-1 


4 

a 


a + 1 
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Since -1 = 1 in GF(2) , one has 


i 

The rest of the element a for 5^i<15 can be obtained likewise. The 15 nonzero 

4 

field elements of GF(2 ) are shown below in both multiplicative and vector space 
forms . 


0 

a 








1 

o 

( 

0001 

) 

1 

a 

a 





a 



a 

( 

0010 

) 

2 

a 

o 



2 

a 





a 

( 

0100 

) 

3 

a 

C3 

3 

a 







a 

( 

1000 

) 

4 

a 

O 





a 

+ 

1 

a 

( 

0011 

) 

5 

a"' 

a 



2 

a 

+ 

a 



a 

( 

0110 

) 

6 

a 

a 

3 

a 

"f 

2 

a 





a 

( 

1100 

) 

7 

a 

a 

3 

a 




a 

+ 

1 

a 

( 

1011 

) 

8 

a 

a 



2 

a 



4* 

1 

= 

( 

0101 

) 

9 

a 

a 

3 

a 

+ 



a 



a 

( 

1010 

) 

10 

a 

a 



2 

a 

+ 

a 

4* 

1 

= 

( 

0111 

) 

11 

rt 

= 

3 

u 

■f 

2 

a 

+ 

a 



a 

( 

1110 

) 

12 

a 

a 

3 

a 


2 

a 

+ 

a 

4- 

1 

a 

( 

1111 

) 

13 

a 

a 

3 

a 

+ 

2 

a 





= 

( 

1101 

) 

14 

a 

a 

3 

a 





+ 

1 


( 

1001 

) 

15 

a 

= 







1 

= 

( 

0001 

) = a° 


g 

Another example is the finite field GF(2 ) generated by the primitive 

polynomial 

+ x^ + x^ + + 1 

The exponents versus elements and elements versus exponents tables for 

g 

this GF(2 ) are shown in Table 1 and Table 2, respectively. 
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The uddition of two field tlementH in ()F(2*^) is performed by eomponent- 
wlae. addition in the vector representations of the two elements. For example in 
) , ‘I +01 is ^^iven by 




6 ° (O.O.O.O.O.l, 1 ,0) 

<> 

26 

a 



+ 

11 “ (0.0, 0,0, 1,0, 1 ,1) 


2 38 
u 




n ° ( 0 , 0 , 0 . 0 , -., 1 , 0 . 1 ) 


104 

a 


TIu* multiplication of two 

field elements in CF(2 

) in 

performed by a modulo 

(2"-l) 

addi t ion 

on the exponents 

of the tvi/o elements. 

For example in GF(2^), * 

2JH 

u 

is given 

by 

b - (0,0, 0,0, 0,1, 1,0) 


26 



*) 

11 = (0,0,0,0,1,0, 1,1) 


2 38 

a 




5« - (0,0, 1,1, 1,0, 1,0) 


f26 + 238) MOD 255 
a 
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SECTION III 


REED-SOLOMON ENCODING PROCEDURES 

An RS c'ihIc* word linti symbols, whoro t*iu h symbol luis .) bits. 01 

the (2‘*-l) symbols there are (2‘^-l-2E) Intormatlon symbols and 2E purily-ehei’k 
symbols, where E Is the number ol symbols an RS code is able to correct. If 
one treats the (2"'-l-2E) information symbols as the coefficients of the poly- 
nomial 


f(x) 


2E 


5 + 8 . X-l h 8,.X 

2' -l-2E 2 -2-2E 


2‘’-2-2E 


+ s^x 


2 -1-2E 


where s^ Is the Ith transmitted symbol, then the 2E par if'-check symbols cun be 
obtained as the coefficients of the remainder of 


f (x) /g(x) 


where g(x) is the generator polynomial (Ref. 9) of the code. Usually g(x) is 
defined as 

g(x) ° n (x-a ) “ 

i=l j“o ^ 


where a is a primitive element 
efficients of g(x) with g,,^ =■ 
have symmetrical coefficients, 


of the Galois field GF(2'^1 , and ^^e the co- 

1. The generator polynomial defined above does not 
i.e. , 


8j 82 E_j J “ 0, 1, 2, — , 2E. 


A block diagram of an RS encoder which generates the remainder of 
f(x)/g(x) is given in Fig. 1. The switches in Fig. 1 are normally in the "ON" 
position until the lust information symbol gets into the encoder. At this 
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Figure 1. A Block Diagram of a (2 -1, 2 -1-2E) RS Encoder 
Using a Conventional Generator Polynomial. 

Legend: 0 = Finite Field Adder, 0 = Finite Field Multiplier 

(sH = J-bit Shift Register. 













moment all switches are switched to the "OFF" position and the encoder is be- 
having like a long shift register. The output of the encoder is then taken from 
the output of the last shift register. Note that in Fig. 1, 2E multipliers are 
needed in the encoder. 

To reduce the number of multipliers needed, a special class of the 
generator polynomial which has symmetrical coefficients was proposed by Berlekamp 
(Ref. 10). This generator polynomial is defined as 

2'^"^+E-l 2E 

g(x) = n (x-a ) ” g.x-^ 

I_1 J 


where 


“ ^2E-j ^o “ ^2E ° 

Note that since g == 1 , only E multipliers are needed (see Fig. 2). Thus using 
this new generator polynominal will reduce the number of multipliers required by 
one-half. As an example, the coefficients of all generator polynomials of the form 

g(x) = [^ (x - 

i=112 

for a 16-error-correcting RS code with 8-bit per symbol are shown in Table 3 for 
a = 2, 128, 232, 135, 201, 90, 7A, 119, respectively. 

There are several schemes for interleaving the RS codes (Ref. 5). One 

scheme illustrated in Fig. 3 as "Interleave B" requires memory only for the parity- 

check symbols in the encoder is described as follows. In this scheme the input 

bits are grouped into J-bit symbols and transmitted in their natural order. How- 
L h 

ever every I symbol belongs to the same code word, where I is the interleaving 
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Table 3. Generator Polynomial Coefficients for g(x) 


ORIGINAL PACl * 
OF POOR QUALIl'Y 


O' 

r-i f-(mr'joovOO<j’0'rHiooo'«4’OOfs<notnrs)oO'»TO'OiOr-<0''j’ 
r-4 r^o'inw/'OO'irir^ vtcN<sa3-or"QOr^'3'oorMcs-<f rhino' 

II I — I I — I 1V| iH I — I CS rH r-4 I — I CN rH i-H <N rH cv| r-H r-H 


-<r rHoomcx3f^kOOmoO'4’csr^f^oO'3’'Oirivo-^oor'»r^rM'3'cOf^o 

r~ mmoovOfOcsO'>jOiHr'‘OOi-(OcNOi-^(»'r^r-to<tO'tS(n 

II pH p— ^ p— 4 pH p-H p-^ p-H pH pp 4 <N pH pH pH pH pH pH iH pH pH 

•H rt 

I 

X 


4 N 


m 

pd- 


cz 


O iHO'i^fOr- OroO'O''Omoor^«4’<rmcx5fn<f<}-i^oorO'O0'O'(n 
O' vD-d-r-^r^oomincvioOpH<roO'DoO'Ooo'0<X)-<riHOO<Nvnir\ 
tj pH CN I — I pHpHp-4 pH iH pH pHpHpH 

rt 


II 


O HC'pHOnm<Hr^vOpHOO'0(NiO(J3pHOpHoOLncN'OoOpH'Or~rO 

IN ptpHsrvD<j-cso'mpd-oomo«d'0'(H<Nroo'-<j-oinoo<rroo'CN 

II I — I iH CN CN 1 — I rH pH CN iH pH pH CN CN 


lA 

CO pHNr00NfcnO'-<rpHp3'00 00 -d'pH<N'X)O' 0 OO''OCNpH tOOCO-d'pHNr 

pH r'.iA<DiAOO<fO'iAiAoO<no'r^CNiAr'.iA<Nr~0' "00'AiAO'>4- 

I] pH pH CN pH pH pH pH <N pH CN CN pH CN pH pH pH 


CN 

(H pHrA«jp£)cN'Dpvr'd'Cj'Cj'r^pHvDco-d’'Ct''A»d'Nrco'OpHr^o'c3'<r>cr 
CN pHooorocHNro' iTi'Oooininor^ror^ommoO'Oiri cj'-j- 

II CN pH pH pH pH pH pH CN pH CN pH pH pH 


00 

CN pHt^pHO'r'Nr'DKfcH ■►ro-ocOr-r^cN'OcNr^r'.oONrro^rcA-d-vD 
pH cniAOntOcncsOpH .~:.(HO'iAcH cAiTiO'. cnr^ro pHoocn 
II CNpHH pH pHCN pHpHCN pHCN CNpH C'JpHpH CNpH 

H 


CN iH'.0<rOmoOr^pHr^pHpH\0-cr(NCT'(NCJOCNO'C''4-«JvOpHpHr'.pHr^ 

II cH<rcNcocnmo <rHCNcniAOcNr^cNOiArocNpH<i- o<n 

tS CNCNCNpHCNpHCN pH CN CNCN CNCN CN pH CNpH 


VH OiHcNcO>d'iA^r^COO'OpHcNm<fu^\Or^COC3'OpHcNcANriA Or^OOCJ'OpHcN 
O pHpHpHpHpHpHpHpHpHpHCNCNCNCNCNCNCNCNCNCNrACArO 

M 
Q 


3-4 


238 lOA 36 89 43 80 65 160 

133 47 132 153 63 177 187 246 

220 109 106 204 140 73 138 158 

244 151 184 158 11 247 135 192 

236 227 213 174 139 169 8 73 
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Figure 3. Code Array Structure and Order of Synbol Transmission 
For Type B Interlea /ing. Where Interleaving Level = I 



di*pth iiiuul. TIujm I fiuk* wordfj malu> up tiut'li an Interleaved code block. After the 
In I orm.i t Ion aymbolfi are t ranumi 1 1 ed , the parity-check aymbola ol each Interleaved 
code word are then transmitted. 

If Interleaving.’ is ufjed, then the encoder loj’tc utructure la the aame 
US shown In Tip,. 1, except now each .I-blt shift rep, inter in replaced by an I x 
I-l)lt f.hlft rep.l*iter. An an ex.imi)le, .1 l)lock dlap.ram ol ,i RS encoder 

with Interleaving level I and p.cnerator polynomial 

P,(x) “ \\ (x-a^) 

l°l 12 

O 

where a ° 2 in GF(2 ), which in generated by .lu primitive polynomial 
x^ + X ^ + x^ + x^ + 1 , 

is ahown in Figure A. Note that a generator polynomial with symmetrical co- 
efficients is used here to save multipliers. 
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Figure 4. A Block Diagram of a (255,223) RS Encoder With 
Interleaving Level I and 
1A3 


g(x) 


n 

i°112 


(x-a^) 
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SECTION IV 


SYMBOL-SLICE VLSI RS ENCODER ARCHITECTURE 

A finite field mult Ipl lent ion io a quite eomplieuted operation. There 
are baoicully three techniqueu for Implementing, u finite field multiplication. 

The firut technique in to uue log and antilog tablet] utored In read-only memorietj 
(ROM' a) (Ref. 4). The oecont* technique iti to utje a linear teedbuck nhift regioter 
vype of ajjproach (Ih'i . 10). The thlril tiuhniciue in to une the property ol the 
trace in a finite field to form a omaller ROM It 3k-up table (Ref. 11). Due to 
the advent of LSI ROM technology, techniques 1 and 3 are usually used in an RS 
encoder deoign optimized for discrete IC's. As an example a AOO KHZ (255,233) 

RS encoder using the Berlekamp's approach (Ref. il) requires only around 40 CMOS 


When one is Interested in further drastic reduction of the power and 
size of an RS enc' 'iei for high speed applications, one has to consider VLSI imple- 
mentations. An RS encoder design optimized for discrete IC's usually does not 
have a modular structure. Hence when one uses such an architecture for VLSI lay- 
out, one has the following problems: 

(1) The design is too big to be put on one chip. 

(2) If a multichip approach is used, then one needs several chip 
designs, where each chip has an impractical number of input/ 
output pins. 

(3) The design is not modular. Therefore the design is not easy to 
adapt to other RS code parameters. 
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Hence there 1 h a need to find a Vl.SI logic Btrvicture which can alleviate the 
above i)rol)leniH. 

The repetitive architecture of the RS encoders shown in Figs. 1, 2, 
and ••' 4 , suggested that a symbol-slice type of VLSI chips, each one. consisting of 
u fixed portion of the encoder, may be cascaded to form a complete RS encoder. 

Also to reduce the VLSI chip size, RS encoders using generator polynomials with 
symmetrical coefficients are preferred. Hence we will put emphasis on this 
type of VLSI RS encoder. 

As an example, we will design a VLSI encoder chip for a 255-symbol, 
8-bit per symbol, 16-error-correcting, RS code with an interleaving level of 5. 

The primitive polynomial used is 

8 A 3 ? 

X +x + x + x' + l 

The generator polynomial for this RS code is 

1A3 

g(x) |~[ (x-a^) 

i=112 

where a = 2. The coefficients of this g(x) are given in the first column of Table 
3. The encoder logic structure for the above RS code parameter is id.ntical to 
the one sho'^m in Fig. A, except now 1 = 5. There are several ways of partitioning 
the RS encoder into four sections. One way which requires a minimum of input/ 
output pins is to include four rows of logic shown in Fig. A into one section. 

Each section is then realized by a universal VLSI RS encoder chip. Another way 
to partition the RS encoder shown in Fig. A into four sections is to include 8 
rows of logic in each coluion into one section. Each section is then realized by 
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a unlvoi aal VLSI KS iMU'odor i-hlp. Tht'ae VLSI RS cnoodor conf iono are 

d V Ibcd atJ lollowa. 

A. I VLSI US RNCODKU USING ,dL ROW PARTITIONING TKCHNIQUU 

Tlu' lo^^lc ati’iu’tiu ol the univoraal VLSI RS encoder chip uaing the 
row partitioning technique la fd»ovvfn in I’lg. 5. The entire VLSI encoder ayatem, 
wl»lch conaiHlH ol lour identical VLSI RS encoder chlpa coacadod and properly in- 
let ci'imec t ed logetlter ia ahown In Fig. b. Kaclt VLSI RS encoder chip hua 2A plna. 

A detailed deaci Ipt Ion ol the VLSI RS encoder chip and the entire VLSI RS encoder 
ayatein Its ileacrlbt a a loJiowa: 

. 1 . 1 Generator Polynomial Goellicienta Table 

Since a generator polynomial g(x) with aymmetrlcal coelllclenta la 
lined, the coell Icienta ol la al\>/aya 1. Hence there In no need lor a multiplier 
to operate im tills coei lie lent (nee Fig. 4). Gonsequently 11 the new generator 
polynomial is used, then one only needs I’./N multipliers on each VLSI chip, where 
F. Is the eiior collecting c.ipabillty ol the code .ind N la the total number oi 
clilps requlicd In a VLSI encoder system. In the design e.\ample, F “ Ib and N “ ‘'t . 
Hence •'» nm 1 1 iplieis are usetl on each VLSI RS encoder chip. I’ > make the VLSI cMp 
unlveisaT, all distinct coetllclents except 1 ol the generator polynomial are 
stored In a read-only memory on the chip. In general, an Rxd-blt table Is needed. 

In the design example K =* lb, d “ H. Hence a lb x 8-bit table Is selected. The 
outputs ol an F.x.I-bil table is led Into N, F/N-to-one multiplexors. The outputs 

ol the multiplexors are selected by log.,N ln]nit pins called the "chip .select" or 

«.> 

"G select" plna. These outputs are then led into the Inputs ol the F/N multipliers. 
In the design example two "G select" pins (pins 22 and 21) and lour, four-to-one 
multiplexors are used. The lb x 8 table and the multiplexors can easily bo 
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Encoder System 





impl emen Led by four, <'« x 8 ROM, with C, aoloct signals as the address control lines 
of each ROM. The coefficients of g(x) selected on each chip are shown in Fig. 6. 

A. 1.2 Finite Field Multipller 

Next we will discuss the architecture of the finite field multiplier. 

To connect the multiplier properly between chips and at the same time minimize the 
number i>f input /output pins used, a linear feedback shift register type of multi- 
plier (Ref. 9) rather than a ROM table look up type of multiplier (Ref, A) is 
adopted. The multiplier used is of a serial-parallel type. The logic structure 
of the multiplier is shown in Fig. 7. The J-blt generator polynomial coefficient 
is read out from the ExJ-bit ROM table and fed into the multiplier J-blt in paral- 
lel whereas the other input, generated by the feedback input (pin 2) "ANDed" with 
the feedback t>nable (pin 1), is fed into the multiplier bit-by-bit in serial. 

The output of the multiplier is loaded Into an 8-bit shift register in 
parallel at the end of every 8th bit clock (1 symbol clock time) . The parallel 
data is serialized by this shift register. The most significant bit (MSB) output 
of this shift register is added with the MSB of the AO-bit shift register, which 
Is either on the same chip or on a different chip, and the resulting data is 
shifted into the least significant bit (LSB) of the next AO-bit shift register. 

The adder is implemented by a two-input EXCLUSIVE-OR gate (there are eight 
EXCLUSIVE-OR gates on each chip). Each adder takes an input from a multiplier 
output which is either on the same chip or on a different chip, depending on the 
coefficients of the generator polynomial. 

A. 1.3 Input and Feedback Control Switches 

The switch 111 shown in Fig. A is implemented by an AND gate on the chip 
with the bit-serial data input (pin 3) and the feedback enable signal as the 
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Lwo input ». Tlu> U»i‘dbat’k I'nabli’ Hip,nal is provided by an external modul > .’*)') 
eounter whieli is driven by a eloek equal to tlie bit clock divided bv 'tO (see 
I'ip,. (>) . rhls signal Is true vjhen tlie counter is counting, t rom I to .12 1; otlier- 
wise It Is lalse. The output ol switch //I is added witli tlie MSU ol the AO-blt 
sliltl register S‘j output on tlie same chip to generate the leedb.uk output signal 
Cpln ' 3 ). rills signal is redundant in all hut the llrst chip (si>e I'lg. n) . 

I'he sv.fi tell t/2 shi3wn in Fig. 4 is imp 1 ement eil by an AND gale on the 
chip with the ieedb.ick enable and ieedback Input signals as the lv.;o inputs. The 
teedback input signals on all cb.ips (.pin 2) are connected to the Ieedback output 
signal (pin b) on the first VI, SI chip. The output of svvfitch ff'2 is led into all 
multipliers on the chip bit-by-bit in serial. 

'♦.l.-'i Input/Output Data Connections 

There are eight input/output lines on each chip. Of these eight lines, 
four lines (pins 8, 9, 11, 17) are input lines and the remaining are output lines 
(pins (3, 7, 10, 11). Pin 8 is normally connected to pin (3 on the same chip except 
for the last chip, where pin 8 is grounded. Pins 7 and ^ are normally connected 
to pins 17 and 11, respectively on the next chip except for the last chip, where 
pin 7 is connected to pin 11 on the same chip and pin 9 Is connected to pin 4 on 
the first chip. Thus one has a railroad type of data connections betvsreen chips. 

The reason for connecting pin 4 on the first chip to pin 9 on the last chip is a 
consequence of an inherent 8-bit multiplier delay. To replace tlie multiplier 
on the position by the switch 1 output, one needs to delay this output also 

bv 8 bit-, to line up the bits, For other chips besides the I irst chlii, the 8-bit 
register outputs are not used. 


4-8 



The encoder output is taken 8 bits earlier from the USB of the last 
AO-bit shift register on the first chip. Tills is because when the last bit of the 
Last symbol in the information part of the code is shifted into the encoder, the 
contents of each multiplier are loaded into the 8-bit output shift registers 
waiting to be added with the MSB of the AO-bit shift registers. These 8-bit 
symbols in the multiplier output registers actually belong to the fifth code 
word in the Interleaved code array. However the parity-check symbol of the fir.st 
code word is already being computed and now sitting 8 bits from the MSBs of each 
AO-bit shift registers. Hence the 12-blt output (pin lA) of the ^ast AO-bit 
shift register on the last chip Is the output of the VLSI RS encoder system. 

This output is taken when the external modulo 255 counter is counting from 22A 

to 255. Since at these times switches //I and IH are turned off, the entire VLSI 

encoder system is behaving like an lx2ExJ-bit (e.g., 5 x 32 x 8-blt Ip the design 

example) shift register. Thus the 5 x 32, 8-bit parity-check symbols are read 
out from the VLSI RS encoder bit-by-bit in serial and appended to the 5 x 223, 
8-bit information symbols. 

A. 2 VLSI RS ENCODER USING THE COLUMN PARTITIONING TECHNIQUE 

The logic structure of the universal VLSI RS encoder chip using the 
column partitioning technique is shown in Fig. 8. Note that this chip is very 
similar to the one shown in Fig. 5 except now one has to provide output pins to 
all multipliers and input pins to all adders on the chip. Hence this chip uses 
6 more input/output pins than the one shown in Fig. 5. The entire VLSI RS en- 
coder system using this logic structure is similar to the one shown in Fig. 6 
except now the interchip connections between adders and multipliers are of the 
pyramid type shown in Fig. 9. 
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Figure 9. Inter-Chip Connection Diagram (Column Partitioning). 


The 40-bit ohlft reglaterfi in the above two veroiona of the VLSI 
RS eacoder chips can be replaced by random access memories (RAMs) • In this c.ise, 
a write-af tcr-rcad operation should be performed during each bit Lime from the same 
location Just read out from the RAM to simulate the shift re^'ister operation. 
Consequently, this version will not operate as fast as the shift register version. 
The first advantage of using the RAM approach is that interleaving level control 
can easily be incorporated into the RAM address control logic. Of course, input 
pins must be provided to select the interleaving level. Hence a more flexible 
VLSI RS encoder chip can be obtained using the RAM approach. 

The second advantage of using the RAM approach is that a RAM cell 
usually occupies a smaller chip area than that of a static shift register. Hence 
a smaller VLSI RS encoder chip size can be obtained using the RAM approach. It 
is estimated that it is impossible to put the entire shift register version of the 
VLSI RS encoder chip design on a 235x235 mils CMOS/bulk VLSI chip using a 7 um 
f.tandard cell approach on all logic. However, if one uses custom RAM and ROM 
cells, design to implement the 40-bit static shift registers and the 16x8 table 
and 7 urn standard cell design for the rest of the logic, then it is possible to 
have a one-chip design. Of course, the entire RS encoder chip design can easily 
be put on a smaller chip if a VI. SI technology (say 3 pm standard cell approach) 
is used. 
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SECTION V 


PERFORMANCE OF THE VLSI RS 
ENCODER SYSTFJ1 

For verification, both the uhlft register version and the 

RAM version of the VLSI RS encoder chip and VLSI RS encoder system are implemented 
using discrete CMOS IC's and are now operational. The throughputs of these two 
versions are BOOK blts/sec for the shift register version and 200K bits/sec for 
the RAM version. These throughputs are expected to go much higher if the actual 
VLSI encoder chips are us'>d. 

One technique to improve t lie VI. SI RS encoding speed Is to multiplex 
the RS encoder chips. i)ne type ot multiplexing is to set tlie RS encoder chip 
interleaving level selection in the RAM approach to 1 (no interleaving) such that 
for a N-ievel of interleaving, a N-lold parallelism can be achieved. ITiis scheme 
is lllustrateii In Fig. 10. Note that innTi row ot the RS eiu'odiT I'liljis In Fig. 10 
are used to encoile .i RS t'ode won! cor responding t' the one shown in each row i>l 
the code .irrav structure In Fig. i. 

Another type ot multiplexing is to use the relationship that it 
f(x) “ (x) + f , (x) + + (x) 

then 

too „ ^ ^ . Vl’ 

g(x) “ g(x) g(x) g(x) 

can be realized by Implementing f^(x)/g(x) for i => 1 , 2 , , N in parallel and 

then summing the results from these parallel operations. Thus if one treats each 
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Figure 10. Throughput Improvement by Multiplexing VLSI RS Encoder Chips 
(Row Partitioning Version). 












f^(x) an a polynomial, wUoHe coef f ic lento arc oclocted from every N Incoming 
oymboltj otartlng from the I*"'' oymbol , where 1 ° 1, 2, N, then one can oelect 

the encoder chip interleaving level to 1 and uoe the logic otructure oirailar to 
the one ohnwn in Fig. 10 to realize t]u> t^(x)/g(x) operation. The output of each 
row of encoder chlpo In thio caue needo to be properly delayed and auiraned to 
generate the encoder output f(x)/g(x). 

Another technique to improve the Vl.SI RS encoding speed is to process 
the J-bit Incoming symbols in parallel. Parallel adders and multipliers are 
needed in this configuration. A throughput Improvement of J times cun bo 
achieved using this approach. The disadvantages of this technique are: 

(1) A lot of input/output pine are needed for the parallel data 
paths . 

(2) A larger chip size Is required to implement the Vl.SI RS encoder. 
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SECTION VI 


CONCLUSIONS 

We have Just shown tlie logic’ structure of a symbolic-slice VLSI 
RS encoder chip and the VLSI RS encoder system built by these chips. A design 
example has been given for a (255,223) VLSI RS encoder chip and VLSI RS encoder 
system. It Ivis been shown that an RS encoder consisting of four Identical CMOS 
VLSI RS encoder chips connected together may replace around AO CMOS IC's re- 
quired by an encoder design optimized for discrete IC's. Besides the size ad- 
vantage, the VLSI RS encoder also has the potential advantages of requiring less 
power and having a higher reliability. Finally, it is shown that such chips 
can easily be multiplexed to improve the encoding speed. The symbol -s 1 iced logic 
structure presented In the report could also be applied to design VLSI RS 
decoders [12]. A separate report on this subject will be provided. 
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APPENDIX 

VLSI RKKn-SOLOMON LNCODKU FOR TllK PROPOSED NASA/KSA 
TKLKHRTP.Y CHANNRI. CODINL STANDARD 
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M t iM t lu' i-omp lot ton ol t lu» VI, SI KS oiu'odor projoot, I lio following 
not ol US i iulo pai .niU'tot H Ihivo Ihhmi proponod In l ho NASA/I'.SA tolomotry ohannol 
lOilinp, st.iiuUiid (Uol . Al): 


(1) jM InilLlvo polvnonilal 


8 7 J 

X + X + X -l'X+1 


(.’) j'oiiorator polynonial 


„U. f] [x- 


I-ll, 


wlioro o satlHl los t ho ocjuat ion 
8 7 J 

X + x + x + x+ l=0. 

(3) Number of bits per symbol (J) 8 . 

(x' 4 ) Number of symbols per eode v^;ord (N) = 25'). 

(5) Number of correctable symbol errors (E) = lb. 

(b) Interleaving depth (I) ° 5. 


Note that the foregoing RS code parameters are tliose used in the 
design example In Section IV with two exceptions. These are tlte primitive 
polynomial (l.e., the field generator polynomial over tlF( 2 ) and the code gener- 
ator polynomial. These polynomials were selected by Berlekanp in an architecture 
which minimizes the discrete ICs. Reference A2 provides a detailed description 
of such an arclii tec t ure . To adapt to th's new set of "S code parameters, one 
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neiids to use a new field element table as well as a new table of generator 
polynomial coefficients. These are given in Tables A1 and A2 respectively. 

Thus, to design a VLSI RS encoder for the new code parameters, one 
only needs to replace tlie generator polynomial coefficients ROM table shown in 
Figure 5 by the coefficients given in Table A2 under the heading Also one 

needs to replace the finite field multiplier shown in Figure 7 by the multiplier 
shown in Figure Al. 
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Table Al. Elements versus exponents in a finite field GF(2 ) generated by the primitive polynomial 
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Table A2 . Generator Polynomial Coefficients for 
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Figure Al. Logic structure of a serial- parallel finite field multiplier which performs 
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